A Survey on Balanced Data Clustering Algorithms
نویسنده
چکیده
A R T I C L E I N F O A B S T R A C T Article history: Received 25 March 2017 Received in revised form 27 March 2017 Accepted 29 March 2017 Available online 31 March 2017 In an era of data increasing at a proportion larger than ever, analysis of such data covering all businesses and sectors becomes all the more important. Data mining tasks, particularly clustering, aim at partitioning bulks of data into similar groups/clusters such that data points belonging to one cluster are semantically different from the data points belonging to different clusters. While the clustering algorithms proposed till date for enabling data searching, sorting, analysis, updation and more are numerous; the recent consideration is to achieve balance in clusters. Balance can be achieved in terms of size, importance, density and many other aspects. Balance also enhances the clustering performance of datasets. This paper surveys some of the popular balanced clustering algorithms.
منابع مشابه
Application of modified balanced iterative reducing and clustering using hierarchies algorithm in parceling of brain performance using fMRI data
Introduction: Clustering of human brain is a very useful tool for diagnosis, treatment, and tracking of brain tumors. There are several methods in this category in order to do this. In this study, modified balanced iterative reducing and clustering using hierarchies (m-BIRCH) was introduced for brain activation clustering. This algorithm has an appropriate speed and good scalability in dealing ...
متن کاملارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها
Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...
متن کاملA Comparative Study of Some Clustering Algorithms on Shape Data
Recently, some statistical studies have been done using the shape data. One of these studies is clustering shape data, which is the main topic of this paper. We are going to study some clustering algorithms on shape data and then introduce the best algorithm based on accuracy, speed, and scalability criteria. In addition, we propose a method for representing the shape data that facilitates and ...
متن کاملAssessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملAn improved opposition-based Crow Search Algorithm for Data Clustering
Data clustering is an ideal way of working with a huge amount of data and looking for a structure in the dataset. In other words, clustering is the classification of the same data; the similarity among the data in a cluster is maximum and the similarity among the data in the different clusters is minimal. The innovation of this paper is a clustering method based on the Crow Search Algorithm (CS...
متن کامل